Picture for Yaowei Wang

Yaowei Wang

Seeing Through the Chain: Mitigate Hallucination in Multimodal Reasoning Models via CoT Compression and Contrastive Preference Optimization

Add code
Feb 03, 2026
Viaarxiv icon

CLIP-Guided Adaptable Self-Supervised Learning for Human-Centric Visual Tasks

Add code
Jan 19, 2026
Viaarxiv icon

Splatwizard: A Benchmark Toolkit for 3D Gaussian Splatting Compression

Add code
Dec 31, 2025
Viaarxiv icon

A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis

Add code
Dec 16, 2025
Figure 1 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 2 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 3 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 4 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Viaarxiv icon

Training-free Context-adaptive Attention for Efficient Long Context Modeling

Add code
Dec 10, 2025
Figure 1 for Training-free Context-adaptive Attention for Efficient Long Context Modeling
Figure 2 for Training-free Context-adaptive Attention for Efficient Long Context Modeling
Figure 3 for Training-free Context-adaptive Attention for Efficient Long Context Modeling
Figure 4 for Training-free Context-adaptive Attention for Efficient Long Context Modeling
Viaarxiv icon

Did Models Sufficient Learn? Attribution-Guided Training via Subset-Selected Counterfactual Augmentation

Add code
Nov 15, 2025
Viaarxiv icon

LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

Add code
Nov 09, 2025
Figure 1 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 2 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 3 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 4 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Viaarxiv icon

ESTR-CoT: Towards Explainable and Accurate Event Stream based Scene Text Recognition with Chain-of-Thought Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

NN-Former: Rethinking Graph Structure in Neural Architecture Representation

Add code
Jul 01, 2025
Figure 1 for NN-Former: Rethinking Graph Structure in Neural Architecture Representation
Figure 2 for NN-Former: Rethinking Graph Structure in Neural Architecture Representation
Figure 3 for NN-Former: Rethinking Graph Structure in Neural Architecture Representation
Viaarxiv icon

RadioDUN: A Physics-Inspired Deep Unfolding Network for Radio Map Estimation

Add code
Jun 10, 2025
Figure 1 for RadioDUN: A Physics-Inspired Deep Unfolding Network for Radio Map Estimation
Figure 2 for RadioDUN: A Physics-Inspired Deep Unfolding Network for Radio Map Estimation
Figure 3 for RadioDUN: A Physics-Inspired Deep Unfolding Network for Radio Map Estimation
Figure 4 for RadioDUN: A Physics-Inspired Deep Unfolding Network for Radio Map Estimation
Viaarxiv icon